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^ ■ Abstract: In this paper, we study the optimal control problems of 

Q ! controlled time-symmetric forward-backward stochastic differential equa- 

^ I tions with initial-terminal sate constraints. Applying the terminal pertur- 

•^ I bation method and Ekeland's variation principle, a necessary condition of 

Q ' the stochastic optimal control i.e. stochastic maximum principle is derived. 

Applications to backward doubly stochastic linear-quadratic control models 
as well as other specific models are investigated. 
k>( I Keywords: Time-symmetric forward-backward stochastic differential 

H I equations, Ekeland's variation principle. State constraints. Stochastic max- 

imum principle. 



1. Introduction 

It is well known that a general coupled forward-backward stochastic 
differential equations (FBSDEs in short) consists of a forward SDE of Ito 
type and a backward SDE of Pardoux-Peng's (for detail see [5], [16]). Since 
Antonelli [1] first studied FBSDEs in early 1990s, FBSDEs have been deeply 
studied in many papers (see [8], [13-14], [19] for details). 



In order to produce a probabilistic representation of certain quasilinear 
stochastic partial differential equations (SPDEs), Pardoux and Peng [17] 
first introduced a class of stochastic differential equations, i.e. backward 
doubly stochastic differential equations (BDSDEs for short) and proved the 
existence and uniqueness theorem of BDSDEs. Using such BDSDEs they 
proved the existence and uniqueness theorem of those quasilinear SPDEs 
and thus significantly extended the famous Feynman-Kac formula for such 
SPDEs. 

Connecting the theory of FBSDEs and BDSDEs, Peng and Shi [18] 
studied the following time-symmetric forward-backward stochastic differen- 
tial equations: 

-dxt = F{t, xt, zt, yt, qt, Ut)dt + G{t, xt, zt, yt, qt, Ut)dWt - ZtdBt, <t <T, 

-dyt = fit, Xt, Zt, yt, qt, Ut)dt + g{t, x*, Zt, yt, qt, Ut)dBt - qtdWt, 0<t<T, 
Vt = V- 

Here the forward equation is "forward" with respect to a standard 
stochastic integral dWt, as well as "backward" with respect to a backward 
stochastic integral dBt; the coupled "backward equation" is "forward" under 
the backward stochastic integral dBt and "backward" under the forward one, 
which generalized the general FBSDEs. In other wards, both the forward 
equation and the backward one are BDSDEs with different directions of 
stochastic integral. Under certain monotonicity conditions, they proved the 
uniqueness and existence theorem for these equations. 

In this paper, we study a stochastic optimal control problem with 
initial-terminal state constraints where the controlled system is described 
by the above time-symmetric forward-backward stochastic differential equa- 
tions. We suppose that the initial state C, and the terminal state r] fall in two 
convex sets respectively and the corresponding states Xrf'^ and i/g 
satisfy the constraints E{ijj{xj^'^ )) = a and E{h{yQ^)) = b respec- 
tively. Then we minimize the following cost function: 

J(e, V, «(■)) = E[[ l{x{t), z{t),y{t),q{t),u{t),t)dt+x{0+Kv)+Hx{T))+l{ym]. 
Jo 

It is well known that the maximum principle is an important approach 
to study optimal control problems. The systematic account on this theory 
can be found in [2] and [22]. When the controlled system under consider- 
ation is assumed to be with state constraints, especially with sample-wise 



constraints, the corresponding stochastic optimal control problems are dif- 
ficult to solve. A sample-wise constraint requires that the state be in a 
given set with probability 1; for example, a nonnegativity constraint on the 
wealth process, i.e., bankruptcy prohibition in financial markets. In order 
to deal with such optimal control problems, an approach named "terminal 
perturbation method" was introduced and applied in financial optimization 
problems recently (see [9-12]). This method is based on the dual method or 
martingale method introduced by Bieleckiet in [3] and El Karoui, Peng and 
Quenez in [6]. It mainly applies Ekeland's variational principle to tackle the 
state constraints and derive a stochastic maximum principle which charac- 
terizes the optimal solution. For other works about the optimization prob- 
lem with state constraints, the readers may refer to [20-21]. In this paper, a 
stochastic maximum principle is obtained for the controlled time-symmetric 
FBSDEs with initial-terminal state constraints by using Ekeland's varia- 
tional principle. Different from [7], our controlled system is time-symmetric 
forward-backward stochastic differential equations and sample-wise initial 
and terminal state constraints are considered. 

We give three specific cases to illustrate the applications of our obtained 
results. In the first case, the controlled system equations are composed of a 
normal Forward SDE and a Backward doubly SDE. We only consider one 
Backward doubly SDE as our system equation in the second case. Finally 
we study the backward doubly stochastic linear-quadratic (LQ) problems. 

This paper is organized as follows. In section 2, applying Ekeland's 
variation principle we obtain a stochastic maximum principle of this con- 
trolled time- symmetric forward-backward stochastic differential equations 
with initial-terminal state constraints. Some applications are given in the 
last section. 

2. The main problem 

§ 2.1 Preliminaries 

Let us first recall the existence and uniqueness results of the backward 
doubly stochastic differential equations (BDSDEs) which was introduced by 
Pardoux and Peng [17], and an extension of the well known Ito's formula 
which would be often used in this paper. 

Let (fi, J^, P) be a probability space, and T > be fixed throughout 
this paper. Let {Wt,0 < t < T} and {Bt,0 < t < T} be two mutually 
independent standard Brownian motion processes, with values respectively 



in W^ and in M} , defined on (^2, J-", P). Let N denote the class of P-null set 
of T. For eacli t G [0, T], we define: Tt = J^V V J^^t. Wliere 

J^w ^ jzw ^ a{W, -VFo;0<r <t}VN, 
jrs = J^B^ = a{Br -Bt;t<r<T}WN. 

Note tliat tlie collection {J^t,t ^ [0)^]} is neither increasing nor de- 
creasing, and it does not constitute a filtration. 

For any Euclidean space H, we denote by < ■, ■ > the scale product of 
H. The Euclidean norm of a vector y G M'^ will be denoted by \y\, and for a 
dx n matrix A, we define \\A\\ = y^Tr{AA*). 

For any n e N, let M'^{0, T; R") denote the set of (classes ofdPxdt) a.e. 
equal) n-dimensional jointly measurable random processes {ft',t G [0,T]} 
which satisfy: 

{i)E J^ \ipt\'^dt < oo; 

(ii)V9( is J^r measurable, for a.e. t G [0,T]. 

We denote by 5*^(0, T; R") the set of continuous n-dimensional random 
processes which satisfy: 

{i)E{ sup Iv^ip) < cx); 

0<t<T 

(ii)(y9i is J-i- measurable, for any t G [0, T]. 
Let 

f ■.nx[0,T]xR^ xR'^^^^R^ 

t/ : fi X [0, T] X R'^ X R'''"^ -^ R'^^', 
be jointly measurable and such that for any (y, g) G R^ x R'^^^, 

f{;y,q) G M2(0,T;R^), 
g{;y,q) G M2(0,T; R'^x'). 

Moreover, we assume that there exist constants C > and < a < 1 
such that for any {u,t) G fi x [0,T], (|/i,gi), (?/2,?2) e R^' x R^''^', 

\fit,yuqi)-f{t,y2,q2)? < C{\y^-y2\' + \\q^-q2\n .^. 

Wgit^yuQi)- 9i't,y2,q2W < Cbi-?/2p + a||gi -g2 IP- 
Given 77 G L^(r2, J-^-, P;R'^), we consider the following Backward doubly 
stochastic differential equation: 

yt = 'n+ f{s,ys,qs)ds+ g{s,ys,qs)dBs- qsdWs,0<t<T. 
Jt Jt Jt 

(2.1) 
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We note that the integral with respect to {-B*} is a "backward Ito 
integral" and the integral with respect to {VFi} is a standard forward Ito in- 
tegral. These two types of integrals are particular cases of the Ito-Skorohod 
integral, see Nualart and Pardoux [15]. 

By Theorem 1.1 in [17], the above equation (2.1) has a unique solution {y, q) G 

Next let us recall an extension of the well known Ito's formula in [17] 
which would be often used in this paper. 

Lemma 2.1. LetaE ^^([0, T]; M^), /? G M2([0, T]; R^), 7 G M2([0, T]; R'^^'), 
S G M2([0,T];R'^^'^) be such that: 

at = ao+ f Ms + / IsdB, + f 6JW,, 0<t<T. 
Jo Jo Jo 

Then, 

|atP = |aoP + 2j^{as,f3s)ds + 2 J^{as,-fsdBs) - JoWlsW^ds + /^ ||(5,|pds, 
E\at\' = E\ao\' + 2Ej^{as,/3s)ds-Ej^\\-fs\\^ds + Ej^\\6s\\'ds. 

More generally, for G C^(R'^), 

<f){at) = 0(ao) + J^{(p'{as),^s)ds + fy'{as).isdBs) + /„ (0'(a.), dsdW^) 
-\ /,* Tr[<P"{as)^sll]ds + \ £ TrW'{as)6s5:]ds. 

§ 2.2 Problem formulation 
Let X be a nonempty convex subset of R"^'^. We set 
^7[0,r] = {u{-)\u{t) eKa.e.,a.s.,0<t<T;u{-) G M\0,T;W'^)}. 
An element of U[0, T] is called an admissible control. Now let 

F : f] X [0, T] X R" X R"^' x R'^ x R'^^"' x R"^'^ -^ R", 
G ■.nx[0,T]xW X R"^' X R'^ X R'^^'^ x R"^'^ -^ R"^^, 

and 

/ : f] X [0, T] X R" X R"^' x R'^ x R'^^'^ x R"^"' -^ R'', 

g ■.nx[0,T]xW X R"^' x R'^ x R'^^"' x R"^"' -^ R^'^^ 



be jointly measurable such that for any {x, z, y, g) G M" x M"^' x 
and any «(•) G f/[0,T] 

F{;x,z,y,q,u{-)) G M^{0,T;R^), G{;X,z,y,q,u{-)) G M2(0,T;M"x^) 
f{-,x,z,y,q,u{-)) G M\0,T;R'^), g{-,x, z,y,q,ui-)) G M2(0, T; M^x'), 

Let 

C(t) = (x(t),z(t),i/(t),5(t)), 

A(t,C) = {-F,-G,-f,-g){t,C). 

We assume 

(HI) VCi = ixi,zi,yi,q,), C2 = (^2, ^2,1/2, ^2) G R" x M"x' x M'^ x 
jjfexd g^j^^ ^ ^ [0,7"], there exists a constant /x > such that the following 
monotonicity condition holds for any u{-) G f/[0,T] 

< A{t, Ci) - A{t, C2), Ci - C2 >< -yt^lCi - C2I'. 

(H2) There exist constants C > and < a < | such that for any 
{u,t) Gfix [0,T], {xi,Zi,yi,qi,Ui),{x2,Z2,y2,q2,U2) G M'^ x M^^d ^ ^n ^ 
j^nxi ^ ]^"-x<^ the following conditions hold: 

|?9(t, xi, 2;i, 1/1, gi, ui) - 'd(t, X2, Z2, yi, qi,U2)\'^ 

< C{\xi -X2P + 11^1 -2;2|P + \yi -1/2P + Iki -g2|P + 11^1 -^i2|P), 
||x(t,a;i,2;i,2/i,gi,Mi) - x(t,a;2, 2^2, 2/i, gi,M2)|P 

< C(|a;i -a;2p + |?/i-?/2p + Iki - M2in + a(||2;i -^2!^ + Iki-?2in, 

where i? = {F, f), h = {G,g). 

{B.3)F,G, f,g,ip,hj,x, ^,4> and 7 are continuous in their arguments 
and continuously differentiable in (x, z, y, q, u), and the derivatives of F, G, f, g 
in {x,z,y,q,u) are bounded and < ||g'z(-)ll < 2' '-' ^ ll^zlOII < I' 
< IbglOII < 2' '-' ^ ll^9(")ll < 2' ^'^^ derivatives of / in {x,y, z,q,u) 
are bounded by C(l + |a;| + +\z\ + \y\ + \q\ + \\u\\), and the derivatives of (/), 
X and i/j in X are bounded by C(l + \x\) ; 7, A and h in y are bounded by 
C{l + \y\). 

Given ^ G ^^(fi, J-^, P;M™), r] G ^^(fi, jr^, P;M'=) and Vn(-) G [/[0,T] 
now let us consider the following time-symmetric forward-backward differ- 



ential equations: 

-dxt = F{t,Xt,Zt,yt,qt,Ut)dt + G{t,Xt,Zt,yt,qt,Ut)dWt-ZtdBt, 

xo = C, 
-dyt = f{t,Xt,Zt,yt,qt,Ut)dt + g{t,Xt,Zt,yt,qt,Ut)dBt-qtdWt, 

Vt = V- 

(2.2) 
From theorem 2.2 in [18], we get the following theorem: 

Theorem 2.2. For given ^ G L2(fi, J'q, P;K"), r] G ^^(fi, J-^, P;M'^) and 
V'u(-) G U[0,T], assume (Hl)r^(H3), then (2.2) exist a unique J^t-adapted 
solution {x(t), z(t),y{t),q{t)). 

In (2.2), we regard ^, r], u{-) as controls. ^, r], tt(-) can be chosen from 
the following admissible set : 

U = {{C,r],u{-))\^ eKiC Wa.s., r] e K2 C R'^a.s., E[\C\^] < 00, E[\r]\^] < 00, «(■) G U[0,T]}. 

We also assume the state constraints E{tp{x^'^ )) = a and E(h{yQ^)) = 
b. 

For each {^,ri,u{-)) G U, consider the following cost function: 

= E[/o" l{x{t), z{t),y{t),q{t),u{t),t)dt + xiO + Kv) + <l^{x{T)) (2.3) 
+7(1/(0))]. 
Our optimization problem is: 

(«,J))6C/ '^'"^'''' (^2 4) 

subject to £;(V'(xf '''•"^■^^)) = a- E(/i(|/J«''''""^)) = h. 

Definition 2.3. A triple of random variable {^,ri,u{-)) E U is called feasible 
for given a G W, b E M.^ if the solution (2.2) satisfy E{i/j{x>^"^'^^''')) = a 
and £'(/i(?/q )) = b. We shall denote by N{a,b) the set of all feasible 

{^, r], u{-)) for any given a and b. 

A feasible {^* , t]* , u* {■)) is called optimal if it attains the maximum of 
J{^,ri,u{-)) over N(a, 6). 

The aim of this paper is to obtain a characterization of {^* , rj* , u* {■)) , 
i.e. a stochastic maximum principle. 



§ 2.3 Stochastic Maximum Principle 

Using Ekeland's variational principle, we derive maximum principle for 
the optimization problem (2.4) in this section. For simplicity, we first study 
the case where l{y{t) , z{t) , y(t) , q(t) , u{t) ,t) = 0, x(a;) = and X{y) = in 
subsection 2.3.1-2.3.3, and then present the results for the general case in 
subsection 2.3.4. 

2.3.1 Variational equations 
For ((^^, 'r]^,u^{-)), ((^^, t]'^, u'^{-)) G U, we define a metric in U by 

It is obvious that {U, d{-, ■)) is a complete metric space. 

Let {^* , T)* , u* {■)) be optimal and {x*{-),z *{-),y*{-),q *(■)) be the cor- 
responding state processes of (2.2). V((^, r], «(■)) G U and VO < p < 1, 

(r + p(e - n,v* + piv - ^*), «*(■) + pH-) - u*{-))) e u. 

Let {x^{-),z ^{■),y^{-),q ''(•)) be the state processes of (2.2) associated 

with (e + p(e - e),v* + Piv - v*),u*i-) + pH-) - «*(■)))• 

To derive the first-order necessary condition, we let (£(■), z (■), y{-), q{-)) 
be the solution of the following time-symmetric forward-backward differen- 
tial equations: 



/ 



(2.5) 



-dxt = {F:{t)xt + F:{t)zt + F;{t)yt + F;{t)qt + F:{t){u{-) 
-u*{-)))dt + {Gl{t)xt + GlUYzt + Gl{t)yt + Gl{t)qt 
+Gl{t){u{-) - u*{-)))dWt - ZtdB{t), 

-dyt = u:{t)h + f:{t)zt + f;{t)yt + f;{t)qt + m)H-) 

-u*{-)))dt + {gl{t)xt + glityzt + gl{t)yt + gl{t)qt 
+gl(t){u{-)-u*{-)))dB,-q,dW{t), 

Vt = V-V\ 

where iJ,*(t) = Hk{x*{-),z *{-),y*{-),q *{-),u*{-),t) for H = F,G,f,g,k 
x,z,y,q,u. 

Equation (2.5) is called the variation equation. 



Set 



Xp(t) = p ^[xp{t) - x*{t)] - x{t), 
~Zpit)=p-'[zpit)-z*{t)]-zit), 

yp{t) = p''[ypit)-y*m-m, (2.6) 



We have the following convergence. 

Lemma 2.4. Assuming (HI) ~ {H3) we have 

lim sup E[\xp{t)\'^] = 0, 
^raE[J^\\zpit)\M = 0, 

p—>0 

hm sup E[\yp{t)\^] = 0, 
\\mE[j^\\qp{t)\\'dt] = (). 



(2.7) 



Proof. From (2.2) and (2.5), we have 

-dyp{t) = p~'[f{xp{t), Zp{t),yp{t),qp{t),Up{t),t) - f {x* (t) , z* (t) , y* (t) , u* (t) , q*{t),t) 

-pfmrn - pn{t)z{t) - pf;imt) - pf*{t)m - pf:mu{.) - u*{-))]dt 

+p-'[g{xp{t),Zp{t),yp{t), qp{t),Up{t),t) - g{x*{t), z* (t) , y* (t) , u* {t) , q* (t) , t) 
-pg:{t)x{t) - pg:{t)z{t) - pgl{t)m - pgl{t)q{t) - pgl{t){u{-) - u*{-))]dBt 
-qp{t)dWu 

UT) = 0. 

(2.8) 
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Let 

A'{t) = J^ Ux*{t) + \p{x{t) + £,(t)), z*{t) + \p{z{t) + 5p(t)), ?/*(t) + \p{y{t) + yp(t)), 

g*(t) + \p{q{t) + gp(t)), M*(t) + \p{u{t) - u*{t)),t)d\ 
B'{t) = J^ Mx*{t) + \p{x{t) + i.it)), z*{t) + Xp{z{t) + Zp{t)),y*{t) + Xp{y{t) + ^^(t)), 

q*{t) + Ap(g(t) + gp(t)), M*(t) + Xp{u{t) - u*it)),t)dX, 
C'\t) = J^ fy{x*{t) + Xp{x{t) + i,{t)), z*{t) + Xp{z{t) + ~z,{t)),y*{t) + Ap(y(t) + y,{t)), 

q*{t) + Ap(g(t) + qp{t)),u*{t) + Ap(M(t) - M*(t)), t)dA, 
D'{t) = J^ f,{x*{t) + Xp{x{t) + £,(t)), 2;*(t) + Xp{z{t) + 5p(t)), 2/*(t) + Ap(y(t) + y,{t)), 

q*{t) + Xp{q{t) + gp(t)), M*(t) + Xp{u{t) - u*{t)),t)dX, 

E^{t) = [A^it) - mrnit) + [B^it) - f:it)]m + [c^(t) - f;it)]m + [u^it) - f*it)]m 

+ Io[fu{x*ii) + MHt) + 5p(t)), z*(t) + Xp{z{t) + Zp{t)),y*{t) + Xp{y{t) + y^(t)), 
g*(t) + Ap(g(t) + gp(t)), w*(t) + Ap(tx(t) - u*{t)),t) - f:{t)]{u{t) - u*{t))dX 

and 

Alit) = f^ g.{x*{t) + Ap(x(t) + x,{t)),z*{t) + Xp{z{t) + 5p(t)), ^W + Ap(y(t) + y,{t)), 

q*{t) + Ap(g(t) + qp{t)),u*{t) + Ap(ti(t) - u*{t)),t)dX, 
B{{t) = /o' 9z{x*{t) + Ap(x(t) + £p(t)), 2*(t) + Xp{z{t) + 5p(t)), r (t) + Ap(y(t) + y,(t)), 

g*(t) + Ap(g(t) + gp(t)), M*(t) + Ap(ti(t) - u*{t)),t)dX, 
Cl{t) = jl gy{x*{t) + Ap(x(t) + Xp{t)), z*{t) + Xp{z{t) + 5p(t)), ?/*(t) + Xp{y{t) + ^^(t)), 

5*(t) + Ap(g(t) + qp{t)),u*{t) + Xp{u{t) - u*{t)),t)dX, 
Dl{t) = jl 9,{x*{t) + Ap(x(t) + Xp{t)),z*{t) + Xp{z{t) + 5p(t)), r (t) + Ap(y(t) + yp(t)), 

g*(t) + Ap(g(t) + qp{t)),u*{t) + Xp{u{t) - u*{t)),t)dX, 
E[{t) = [A?(t) - gm]x{t) + [Sf (t) - g:{t)]z{t) + [Cf (t) - ^;(t)]y(t) + [Df (t) - ^*(t)]g(t) 

+ J^[gu{x*{t) + Ap(x(t) + x,{t)),z*{t) + Ap(i(t) + 5p(t)), r (t) + Ap(y(t) + y,{t)), 

q*{t) + Ap(g(t) + q,{t)),u*{t) + Xp{u{t) - u*it)),t) - g:{t)] (tx(t) - u*it))dX. 

Thus 

-rfy,(t) = (A''(t)£,(t) + S''(t)5,(t) + CPit)ypit) + D'^{t)qp{t) + ^''(t)) rft 

+ (A?(t)5,(t) + 5f (t)5,(t) + Cf (t)y,(t) + D^(t)5~,(t) + E'l{t)) dBt 
-q,{t)dW{t), 
yp{T) = 0. 



11 

Using lemma 2.1 to |yp(t)p, we get 

E\ut)\' + E j;" msWds 

= 2Ej^{y,{t), AP{s)Sp{s) + BP{syz,{s) + CP{s)y,{s) + DP{s)q,{s) + EP{s))ds 

where K>0,0<f3i<-^ are constants and 

Jp = Ej^\EP{s)\^ds + Ej^\Ef{s)\^ds. 
Similar analysis shows that 
' -dxp{t) = {A'P{t)S:p{t) + B'P{t)zp{t) + C'f{t)yp{t) + D'P{t)qp{t) + E'P{t)) dt 

+ (Ant)£,(t) + B'rityz.it) + cnt)ut) + ^no^pio + E'nt)) dwt 

-~Zp{t)dB{t), 
5p(0) = 

where H'^it) and H'{{t) {H = A, B, C, D and E) are similarly defined as 
above. 

It yields that 

E\~x,{t)\'' + E jf \\zp{s)\\^ds 
= 2E^^{i,{t),A'P{srXp{s) + B"^{s)zp{s) + C'P{s)y,{s) + D'f{s)qp{ts) + E'P{s))ds 
+EJ^\\A:({s)~x,{s) + Bns)z,{s) + Cr{s)y,{s) + Dr(5)?~p(5) + E'^sWds 

< K(i?/J|y,(s)|2rfs + i?/J|£,(s)|2rfs)+/?2(i?/f ||5,(s)||2rfs + E/f ||Q~,(s)||2rfs + j;) 

where /ir>0,0</32<| are constants and 

j; = C{Ej^\E'f\s)\'ds + Ej^\E'r{s)Hs). 
Since < /?i + /?2 < 1 and -ft' > 0, there exists Ki> Q such that 

E\~xM'' + E\w{t)? + {^-Pi-mE!f\\~z,{sWds + EJ^\\q,{s)\\Hs) 

< Ki{E /f |£p(s)|2rfs) + E /f |yp(s)|2rfs + Jp + j;). 

Since the lebesgue dominated convergence theorem implies 



lim Jo = 0, lim J' = 
we obtain the result by Gronwall's inequality. D 
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2.3.2 Variational inequality 

In this subsection, we apply Ekeland's variational principle [4] to deal 
with initial-terminal state constraints E{iIj{x^''^ )) = a, £'(/i(|/q )) = 

b. 

+ (max(0, E[7(?/f '''*'"*^-») - 7tf •'''""^)] + £))n^ 

(2.9) 
where a and b are the given initial and terminal state constraints and e is 
an arbitrary positive constant. 

It is easy to check that the mapping [^'(■^/'(x^?''''" ))— op, |i?(/i(yQ )) — 

6p, (j){xrj:'^ ) and 7(?/o " ) ^-re all continuous functionals from U to M. 

Theorem 2.5. Suppose (Hl)r^(H3). Let (^* , rj* , u* (■)) be an optimal so- 
lution to (2.4)- Then there exist /iq, /^i, /i2) ^3 ^ ^ with ho, hi < and 
\ho\ + 1^1 1 + 1^2! + I ^3 1 7^ such that the following variational inequality 
holds 

hE < ^.(4*'"* '«*(■«), XT > +hE < /.,(?/f '"^'-'^^yo > 

+h < i?0.(xf '''*'^*(-»),i;T + /^oi57.(l/f '''^•"'^■^^yo > (2.10) 

>0 

where Xt is the solution x{t) of (2.5) at time T, and jjo is the solution y{t) 
of (2. 5) at time 0. 

Proof. It is easy to check that F^{-) is continuous on U such that 

F,{C.r]\u*{-))=e- 
F,(e,r7,w(-))>0, V(e, 77, «(■))€ f/; 
F,{i\r^\u*{-))< inf F,{i,r^M-)) + e. 

Thus, from Ekeland's variational principle [4], 3{^'^,ri'^,u'^{-)) G U such 
that 

(ii)d((e,r7*, «*(■)), (e,^^ «'(■))) <v^; 

(iii) F,(e, V, «(■)) + V^diiC, 7], ui-)), (e, r,^, u%-))) > F,(e, r/^ u%-)). V(e, V, O) e U. 
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For any {^,r],u{-)) E U, set (^^ = ^ + p(^ - C),Vp = V"" + piv - 
V')X,{-) = u^{-)+p{u{-)-u^{-)),Q <p<l. Let (x^(-),?/^(-),^^(-),9^(-)) 
(resp. (x'^(-), |/^(-), 2;^(-), g^(-))) be the solution of (2.2) under (^pj^^p, w^(-)) 
(resp.(^^, r/"^, m'^(-)), and (x'^(-),^'^(-), 5'^(-), g'^(-)) be the solution of (2.5) in 
which (^*, 7]*, «*(•)) is substituted by {^'^ , rj'^ , u"^ {■)) . 

From (iii), we know that 

Feie,, v;, u%-)) - F,(f , r^^ u^{.)) + ^ed{{e,: v;, «p(-)), (f , r]^ tx^(-))) > o. 

(2.11) 
On the other hand, similarly to lemma 2.4 we have 

lim sup E[p-^[xl{t) - x'{t)] - x'{t)]'^ = 0, 
hm sup E[p-'[y;it) - /(t)] - ^(i)]' = 0. 

P-^O 0<t<T 

This leads to the following expansions 

|£(^(x^^-''-"^"^)) - a|2 - |£;(^(xf '''^'"^(•)))) - ap 
= < 2p£;(^(xf ''''•"'"^)) - a, £;[V^,(xf '"''"'")) X (x^)] > +o(p), 

|E(%p>^(-»)) - &p - |i?(Myp^'"^"^)) - fep 

= < 2pi?(Myf '''^•"^(■»)) - 6, i?[/.,(yf '^^'-^(■») X (yg)] > +o(p). 
Applying the linearization technique, then 

E[0(xJ-''-"^"V0(4^^'"^'"^^-^^)] = i?[/o'0.(4^^'''^~^'''^"'''"^^'"^"^''''^VA-px^]; 

So we have the following expansions 

(i?[0(xf '''*'"*(■») - 0(0;$!-''-"^"^)] + ey - (^[0(4*'"* '"*(■») - 0(xf •''^'"^(■»)] + £)2 
= -2p£;[0,(x5?''''' '"'")) • px^][E[0(xf •''*•"*")) - 0(x5?''''''"'"^)] + e], 

(Ebiyf-*^-*^-^^) - 7(l/i'^"^'"^"^)] + ef - (i^bd/f'"*'"*"^) - 7(yr'^^'"^"^)] + e)^ 
= -2p£;[7,(2/f '''^•"^(•») ■ pyg][i?[7(yf •''*'"*^-^^) - 7(1/?^'"^'"^"^)] + e]. 

For the given e, we consider the following four cases: 

Case 1. 

There exists po > such that 

i?[0(xp>^(-)^)-0(4^'''^'"^(-«)]+e>O 
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and 



Emvo' )- 7(2/0 ''')] + ^>o 



for all p G (0,po)- 
In this case, 



p-5>0 ^ 



f'l«g.qg,"^(■))-^F|(€^q^^"(■)) 



Dividing (2.11) by p and sending p to 0, we obtain 
< /i|,i?[V'.(x«''''''"'")) X (x^)] > + < /i|,i?[/i,(yf '"''"'")) X (y§)] > 

> -v^rf((e,r]^n-(■)),(e,^,^(■)))• 

It can be written as 

hlE < V^.(x«' '"''"'(■«), x|, > +hlE < /.,(2/f '"''"'(■«), ^g > 



where 



^0 = - ^^(^.^^t,,.(.)) [^[7(.r--*'"^-^^) - 7(.r^''^'"^-^^)] + .] < 0, 

/^i = -7T77r^^7TT[^[</'(4*'''*'"*^-^^) - 0(xr'''^'"^^-^^)] + £] < 0, 



1 



/^3 = 

Case 


i^e(e. 


1 


u^i-)) 


i2. 


,v', 


u^i-)) 



[^(^(x^«''''''"'")))-o], 
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There exists a position sequence {pn} satisfying pn — ^ such that 



Then 



+ (max(0, ^[7(i/f '''*'"*^-^^) - ^(^(«?"'''?"'"^"(-»)] + £))2}i 
For sufficiently large n, since F£(-) is continuous, we conclude 

+ (max(0, i?[7(l/f '''*'"*"^) - 7(yf •''^'"^"^)] + e))^^- 
Now 



71— >00 



Pn 



_ n^ 1 ^i'«g„■'?k^"pn(^)))-^l(g^'?^"^(^)) 

Similar to case 1 we get 
where 

K = - fi(e.,\t.-(.)) '^'^'^°''"''"'"'' " i'<»»''"''""'"'i + -'1 ^ »• 

fc'i = 0, 
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Case 3. 

There exists a positive sequence {/}„} satisfying p„ — )■ such that 

Case 4. 

There exists a positive sequence {p„} satisfying p„ — )■ such that 

£;[<^(4^-'''-'"-^-^^)-<^(xf •"*'"*(■)))] +e < 0, 
^[7(?/f '"'•"'^■^^ - 7(2/f "•''^"•"^""^)] + e < 0. 

Similar techniques can be used to both case 3 and case 4. 

In summary, for all those cases, we have h^ < 0,h\ < and |/iqP + 
l^iP+ l^2p+ mP ~ l t'y the definition of F^{-). Then there exists a conver- 
gent subsequence of (/i|, h^, h\, h^) whose limit is denoted by (/13, /12, hi, h^). 
On the other hand, it is easy to check that Xq — > , i/q — ?■ as e — ?■ . Thus 
(2.10) holds. D 

2.3.3 Maximum principle 

In this subsection we derive the maximum principle for the case where 
/(x, z, y, z, t) = 0, ({x) = 0, X{y) = and then present the results for the gen- 
eral case in subsection 2.3.4. To this end, we introduce the adjoint process 
(m(-),p(-)) and (n(-),(5(-)) associated with the optimal solution {C^* , t]* , u* {■)) 
to (2.2), which is the solution of the following time-symmetric forward- 
backward stochastic differential equations 



-dm{t) = -{F:{t)m{t) + Gl{t)p{t) + f:{t)n{t) + gl{t)6{t))dt 

HF:it)m{t) + Gl{t)p{t) + f:{t)n{t) + gl{t)5{t))dB, 

-PtdWu 

m{T) = -(/i3V^^(x^) + /ii0^(x^)), 

dn{t) = {F;{t)m{t) + G;it)pit) + f;it)nit)+g;it)5it))dt 

+ (F;(t)m(t) + Glit)pit) + /;(t)n(t) + g;it)6it))dWt 

-StdBt, 

n{0) = {h2hy{y*) + ho-fyiy*o), 

(2.12) 
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where F^{t), f^{t), Gl{t),gl.{t) for k = x,y,z,q are defined as in (2.5). It is 
easy to check that there exist unique processes (m(-),p(-)), (n(-), 6{-)) which 
solve the above equations. 

Theorem 2.6. We assume (Hl)r^(H4). Let {^* , rj* , u* (t)) be optimal and 
{x*{-),z *{-),y*{-),q *(■)) be the corresponding optimal trajectory. Then for 
arbitrary {^,ri,u(t)) E U we have 



<m(0),e-e)><0, 

<n{T),r]-ri* >>0, (2.13) 

< mj:it)+p{t)g:{t) + ntF:{t) + G:{t)5it),u{t) - u*it) >> 0. 

Proof. For any (^,?7,'u(-)) G U, let (x(-),5 (■),y(-), ^(■)) be the solution to 
(2.5). Applying Lemma 2.1 to < m(t),x(t) > + < n(t),y(t) > , we have 

E < m{T),x{T) > +E < n{T),y{T) > -E < m(0),x(0) > -^ < n{0),y{0) > 
= [E j;^ mt{-F:{t)xt - F:{t)zt - F;{t)yt - F*{t)qt - F:{t)ut)dt 
+E J^ Xt{F:{t)m{t) + Gl{t)p{t) + f;{t)n{t) + g*{t)m)dt 
+E J^ Zt{F:{t)m{t) + Gl{t)p{t) + f:{t)n{t) + g:{t)6{t))dt 
-E j^ p{1:){Gl{t)xt + Gl{t)zt + Gl{t)yt + G*(t)g, + Gl{t)ut)dt 
-E j^ n,{fl{t)x, + f:{t)z, + f;{t)y, + f;{t)q, + f:{t)u,)dt 
+E j^ UF;{t)m{t) + Gl{t)p{t) + f;{t)n{t) + g;{t)5{t))dt 
+E J^ q,{F*{t)m{t) + G*(t)p(t) + f;{t)n, + g*(t)5,)rf^ 
-E/Q^(5(t)((7:(t)xt + g:{t)zt + g;{t)yt + (7*(t)5i + gl{t)ut)dt 
= -Ej^ < mtF:{t)+p{t)G:{t)+ntf:{t)+g:{t)6{t),ut > dt. 



This yields 

E < n{T),y{T) > -E < m(0),x(0) > 

+Ef^ < mtF:{t)+p{t)Gl{t) + ntflit) + 6(t)gl{t),u{t) - n*{t) > dt 
= -E < m{T),x{T) > +E < n{0),y{0) > 
= h,E < ^.(4*'"* •"*(■»), a:^ > +h,E < /.,(?/f '"''"* "^ yo > 

+h,E (0.(4*'''* '"*(■»), xt) + hoE (7.(?/f •''*'"*"^ yol 
> 
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V(^, ??,«(■)) e U, we get 

E < n{T),r] - r]* > -E < m(0), ^ - C) > 

+EJ^ < m,F:{t) + p{t)Gl{t) + n,/:(t) + 5{t)gl{t)Mt) " n*{t) > dt 
> 0. 

Thus, it is easy to see that (2.13) holds. D 

2.3.4 The general case 
Define the Hamiltonian 

H -.W X M"^^ X R'^ X R^'^"' X R"^'^ X R" X R"^' x R'^ x R'^^^ x [0, T]^R 

by 

H{x,z,y,q,u,m,p,n,5,t) 
= < F(t,x,z,y,q,u),mt > +G{t,x, z,y,q,u),p{t) > 
< fit, X, z, y, q, u),n{t) > + < g{t, x, z, y, q, u),S{t) > 
+l{t,x,z,y,q,u). 

Now we consider the general case where l{x{t),z{t),t) ^ 0,x(a^) 7^ 
0,A(2/)^0. 

Since the proof of the maximum principle is essentially similar as in 
the preceding subsection, we only present the result without proof. 

Let (r,r/*,M*(-)) be optimal to (2.4) with {x\-),y\-),z *{-),q *(■)) be- 
ing the corresponding optimal trajectory of (2.2). We define the following 
adjoint equations 

' -dm{t) = -(F:{t)m{t) + Gl{t)p{t) + f;it)n{t)+g:{t)S{t) + ll{t))dt 

+ {F:{t)m{t) + Gl{t)p{t) + /:(t)n(t) + g:{t)5{t) + i:{t))dBt - ptdWt, 
m{T) = -(/i3^^(x^) + /ii0^(x^)), 

dnit) = (F;it)mit) + G;it)pit) + f;it)nit) + g;imt) + i;it))dt 

+(F;(t)m(t) + G;{t)p{t) + f;{t)n{t) + glimt) + ll{t))dWt - StdBt, 
n(0) = {h2hy{y*) + ho-fy{y*o), 

where /^(t) = l{t, x* {t) , z* {t) , y* {t) , q*{t)) and i;{t) = h{t, x*{t), z*{t),y*{t), q*{t)). 
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Theorem 2.7. We assume (Hl)r^(H3). Let {^* , rj* , u* {■)) be optimal and 
{x*{-),y*{-), z *{-),q *(■)) be the corresponding optimal trajectory. Then for 
arbitrary {^,ri,u{-)) G U, we have the following inequalities hold 

<Mo)-xx(e),e-e)><o, 

<n{T) + Xy{rj*),r]-ri* >> 0, 

< Hu{t, X* , z* , y* , q* , u* , m, p, n, 6) , u{t) - u*{t) >> 0. 

Remark: Let us denoted the boundary of Ki by dKi. Set 

M ^{we n\C{uj) e dKi}. 

Then 

' m(0) - XxiC) < a.s. on M, 

^(0) - XxiO = a.s. on M"" 
Similar analysis can be used to the boundaries of K2 and K. 

3. Applications 

In this section, we give three specific cases to illustrate the applications 
of our obtained results. 

§ 3.1 Systems composed of a Forward SDE and a BDSDE 

Classical formulation For given ^ G L^{n, Jq, P;K") and u{-) G U[0, T], 

we consider the following controlled system composed of a Forward SDE and 
a Backward doubly SDE. 

dyt = b{t,yt,ut)dt + a{t,yt,ut)dW{t), 

^° = ^1 (3.1) 

dxt = f{t,xt,zt,ytut)dt + g{t,Xt,Zt,ytUt)dWt- ZtdB, 

xo = ^, 

where b & R'' is given, Xq = ^ & Ki,a.s where Ki is a given nonempty 
convex subset in i?". 
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Set the mappings 



b 


Qx [0,T] xR^ xM"^"'^/?^ 


a 


n X [0, T]xR'' X W""^ -^ R^""^, 


f 


Vt X [0, T] X M" X R"^' X R'^ X R"^'^ -^ 


9 


VL X [0, T] X R" X R'^^' X R'^ X R"^'^ -^ 



pnxd 



In this case, we regard u{-) and ^ as the control variables. Define the 
following cost function: 

J(e, ui-)) = E[[ lit, x{t), zit),yit),u{t))dt + x(0 + KviT)) + 0(x(T))], 



where 



X 
A 



fi X [0,T] X 
R" ^ R, 
R'' ^ R, 
R" ^ R. 



r X R"^' X R^' X R"^'^ -^ R, 



We assume: 

(Hl):6, 0", /, g, I, x, A and are continuous in their arguments and con- 
tinuously differentiable in {x,z,y,u); 
(H2) 

\f{t,xi,zi,yi,qi,ui) - {t,X2,Z2,yuqi,U2)\^ 

< C{\xi -X2p + 11^1 -^2|P + \yi -i/2p + ||gi -g2|P + Iki -M2in, 

and 

||^(t,a;i,2;i,2/i,gi,Mi) - /(t,a;2, 2^2, ?/i, gi,M2)|P 
< C(|a;i -X2p + I2/1 -?/2p + ||mi -M2in + a(|ki --22!^ + \\qi -q2\\'^)- 

(H3) The derivatives of b, a, f, g in (x, y, 2;, u) are bounded; the deriva- 
tives of I in {x,y,z,u) are bounded by C(l + |x| + \y\ + \z\ + \\u\\); the 
derivatives of x and in x are bounded by C(l + |a;|); the derivatives of A 
in y are bounded by C(l -|- \y\). 

Then, for given ^ G L^jfi, J'q, P;^") and m(-) G f/[0,T], there exists a 
unique triple 



{x{-),y{-),z{-)) G M=^(0,T;i?") x M^{0,T;R'') x M^{0,T;R 



dxk\ 
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which solves (3.1). 

We assume an additional terminal state constraint i/t = f] & K2, a.s., 
where K2 is a given nonempty convex subset in R''. Our stochastic control 
problem is 

inf ACK)) (3.2) 

subject to u{-) G U[0, T]; ^ E Ki, a.s, rj G K2, a.s. 

Backward formulation From now on, we give an equivalent backward 
formulation of the above stochastic optimal problem (3.2). To do so we 
need an additional assumption: 

(H4) there exists a > such that |o"(|/, ui, t) — a{y, U2, t)\ > a\ui — ^2! 
for all y E R'',te [0, T] and Ui, U2 G i?'^''". 

Note (HI) and (H4) imply the mapping 

u^ (r{y,u,t) 

is a bijection from R'^^"' on to itself for any {y,t). 
Let q = (j{y,u,t) and denote the inverse function hj u = a{y,q,t). 
Then system (3.1) can be rewritten as 

-dyt = f{yt,qt,t)dt-q{t)dW{t), 

yo = b, 

-dxt = F{t,Xt,Zt,yt,qt)dt + G{t,Xt,Zt,yt,qt)dWt- ZtdBt, 

xo = ?, 

where /(y, g, t) = -b{y,a{y,q,t),t) and F{xt, Zt,yt,qt-t) = f{xt,Zt,yta{y,q,t).t), 
G{xt, zt, yt,qt-t) = gixt, zu ytp{y, g, t).t). 

A key observation that inspires our approach of solving problem (3.2) is 
that, since u — ?■ o(x, u, t) is a bijection, q{-) can be regarded as the control; 
moreover, by the BSDE theory selecting q{-) is equivalent to selecting the 
terminal value yr- Hence we introduce the following "controlled" system: 

-dyt = f{t,yt,qt)dt-q{t)dW{t), 

y^ = ^' (3.3) 

-dxt = F{t,xt,zt,yt^qt)dt + G{t,xt,zt,yt,qt)dWt- ZtdBt, 
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where the control variables are the random variables ^ and rj to be chosen 
from the following set 

U = {{C,vM^Ki,a.s.E\^\^ < oo, r]eK2,a.s.E\r]\^ < oo.}. 

For each (^, rj) G U, consider the following cost 

Jo 

where l{x, z, y, q, t) = l{x, z, ya{y, g, t),t). 

This gives rise to the following auxiliary optimization problem: 

inf J(^, 1]) 
(C,'?)6C/ (^3,4) 

subject to ((^, ?7) G [/; i/o = ^• 

where i/q is the solution of (3.3) at time under ^ and i]. 

It is clear that the original problem (3.2) is equivalent to the auxiliary 
one (3.4). 

Hence, hereafter we focus ourselves on solving (3.4). The advantage 
of doing this is that, since ^ and r] now are the control variable, the state 
constraint in (3.2) becomes a control constraint in (3.4), whereas it is well 
known in control theory that a control constraint is much easier to deal with 
than a state constraint. There is, nonetheless, a cost of doing so is that the 
original initial condition yg = b now becomes a constraint, as shown in 
(3.4). 

From now on, we denote the solution of (3.3) by {x^^'^\-), y^^'^\-), q^^'^\-), z^^'^\-)), 
whenever necessary, to show the dependence on {C,,f])- We also denote 
2;(?.^)(^0) and y^^'^\0) by Xq and 2/o respectively. Finally, it is easy to 
check that /, F, G and / satisfy similar conditions in Assumptions (HI) — (H3). 

We note that this is an special case of 2.4, so by the same method we 
have the following result: 



(3.5) 
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Stochastic Maximum Principle We define 

dnit) = {F;{t)m{t)+Gl{t)p{t) + f;{t)n{t) + i;{t))dt 

HF;it)m{t) + Gl{t)p{t) + f;{t)n{t) + /*(t))rfm, 
n(0) = /i2, 
-dm{t) = -{F:{t)m{t) + Gl{t)p{t)+i:{t))dt 

+ (F;(t)m(t) + G:{t)p{t) + /:(t))rfSi - pirfl^i, 
m(T) = -/ii0^(xy), 

where i7,*(t) = Hk{t,x*{t), z*it),y*it),q*it)) {H = F,GJ) and /^(t) = 
fx{t,x*(t),z*(t)), f*{t) = fq{t,x*{t),z*{t)) , hi and /i2 are defined as in 
(2.10). It is easy to check that there exist unique processes n(-) and 
{m{-),p{t)) which solve the above equations. 

Theorem 3.8. We assume (H1)^(H4)- Let {C^V*) be optimal to (3.4) and 
{x* {■) , y* {■) , z *{-),q *(■)) be the corresponding optimal trajectory. Then for 
arbitrary {C,, rj) G U, we have the following inequalities hold 

<miO)-xAe),^-e) < 0, 
<n{T) + Xy{r]*),r]-r]* > > 0. 

§ 3.2 System composed of a BDSDE with state constraints 

Although this case describes a controlled BDSDE system with state 
constraints. But it seems trivial. Thus, we only give a brief illustra- 
tion. Given rj G L'^{Q,J-'t,P',^'') and «(■) G U[0,T], consider the following 
BDSDE. 

-dyt = f{t,yt,qt,ut)dt + g{t,yt,qt,Ut)dBt-qtdWt,0 <t <T, 

Vt = V- 

(3.7) 
For given u{-) G [/[0,T] and f,g satisfying (H2) and (H3), from the 
theorem 1.1 in [17], it is easy to check that there exists a unique solution 
(y(-),?(-))of(3.7).^ 

Note that y^ is a J-(fj.-measurable variable, and 

E{h{y^r^-^^)) = b. 



24 



Now we regard rj and u{-) as the control variables to be chosen from 
the following set 

U = {{V,K))\V e K^,a.s.,EM^] < oo,u{-) e U[0,T]}. 
For each {rj,u{-)) G U, consider the following cost function 

J(77, n(-)) = E[[ liyit),qit),uit),t)dt + \ir]) + 7(^(0))], 

which gives rise to the following optimization problem 

inf J{ri,u{-)) 
subject to £;(/i(2/i'''"^-^))) = b. 

Maximum principle Let 

H{y, q, u, n, 6) =< f{t, y,q,u),nt> + < g{t, y, g, u),S{t) > +l(t, y, q, u). 
Then the adjoint equation is 

dn{t) = {f,;{t)nit)+g;it)6{t) + i;it))dt-5it)dBt + if;it)nit) 

+g;(t)5it) + llit))dWt, 

n(0) = {h,hy{y*iO)) + ho^y{y*m, 

(3.9) 
where iJ,*(t) = Hk{t,y*{t),q*{t)) ioi H = f,g ; k = y,q. 

Theorem 3.9. We assume (Hl)r^(H3). Let rj* and u*{t) be the optimal 
controls and {y* {■) , z* {■)) be the corresponding optimal trajectory. Then we 
have 

<n{T) + Xy{r]*),rj-rj*> > 0, 

<Hu{t,y*,q*,u*,n,6),u{t)-u*{t)> > 0. 

3.3 Backward doubly stochastic LQ problem without state 

constraints 

Consider the following linear system: 
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/ 



-dxt = [A{t)xt + B{t)zt + C{t)yt + D{t)qt + E{t)ut]dt 

+ [A'{t)xt + B'{t)zt + C'it)yt + D'{t)qt + E'{t)ut]dWt 
-ZtdBt, 

+ [A"'{t)xt + S"'(t)z, + C""(t)y, + D"'{t)qt + i5'"(t)ii,]dfi, 

vt = y, 

(3.11) 
where x and y are given constants and H, H', H", H'" {H = A, B, C, D 
and E) are corresponding matrixes. 
The cost function (2.3) becomes 

l{t,x{t),z{t),y{t),q{t),u{t)) = \F{t)x{t)-x{t) + \G{t)z{t)-z{t) 

+ \H{t)y{t) ■ y{t) + \l{t)q{t) ■ q{t) + ij(t)ti(t) ■ u{t), 

and 



X{x) - 


= 0, 


m - 


= 0, 


<P{x) -- 


= i^(t)a;2, 


i{y) = 


= -Mi)y', 



where all function of t are bounded and F{t),G{t),H{t),I{t) are symmet- 
ric non-negative define, J{t),U{t),Q{t) are symmetric uniformly positive 
definite. 

Then from (2.14), the adjoint equations become 

' -dm{t) = -[A{t)m{t) + A'{t)p{t) + A"{t)n{t) + A"'{t)6{t) + F{t)x*{t)]dt 

+ [B{t)m{t) + B'{t)p{t) + B"{t)n{t) + B"'{t)6{t) + G{t)z*{t)]dBt - p{t)dWt, 
m{T) = -U{T)x*{T), 
dn{t) = [C{t)m{t) + C'{t)p{t) + C"{t)n{t) + C"'{t)6{t) + H{t)y*{t)]dt 

+ [D{t)m{t) + D'{t)p{t) + D"{t)n{t) + D"'{t)5{t) + I{t)q*{t)]dWt - 6{t)dBt, 
n(0) = Q{0)y*{0). 
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Define 

H{x,z,y,q,u,m,p,n,6) 
= < F{t,x,z,y,q,u),mt > + < G{t,x, z,y,q,u),p(t) > 
+ < fit, X, z, y, g, u),n{t) > + < g{t, x, z, y, g, u),S{t) > 
+l{t,x,z,y,q,u) 

We have the following result 

Hu{t,x*,z*,y\q*,u*,m,p,n,5) = E{t)m{t)+E'{t)p{t)+E"{t)n{t)+E"'{t)S{t)+J{t)u{t) = 0. 

Thus, 

u*{t) = -J~\t)[E{t)m{t) + E'{t)p{t) + E"{t)n{t) + E"'{t)6{t)]. 

In fact the obtained optimal control u* is not only necessary but also 
sufficient. 

Theorem 3.10. The funUon u* (t) = -J-\t)[E{t)m{t)+E'{t)p{t)+E"{t)n{t) + 
E'"{t)S{t)] is the unique optimal control for backward doubly stochastic LQ 

problems, where {x*{-),z *{-),y*{-),q *(■)) ^'^^ {^'),p{'),''T'{'),^{')) ^f^^ ^o- 
lutions of above equations. 

Proof. (1) We prove that u*{t) is the optimal control. Vv(-) G i?"^'^, let 
{x"" {■) , z'" {■) , y"" {■) , q^ {■)) be the corresponding trajectory of (3.1). We get 

j{v{-y) - j{u*{-)) 

= H / i< F{t)x''{t),x''{t) > - < F{t)x*{t),x*{t) > + < G'(t)z^(t),z^(t) > - < G{t)z*{t),z*{t) > 
Jo 
+ < H{t)y\t),y\t) >-< Hit)y*it),y*it) > + < /(t)g-(t), g'^(t) > - < /(t)g*(t), g*(t) > 

+ < J{t)v{t),v{t) > - < J{t)u*{t),u*{t) >]dt+ < U{T)x\T),x\T) > - < U{T)x*{T),x*{T) > 

+ < Q{0)y-{0),y-{0) >-< Q{0)y*{0),y*{0) >} 

> E{ [<F{t)x*{t),x''{t)-x*{t) > + < G{t)z*{t),z''{t)-z*{t) > + < H{t)y*{t),y''{t)-y*{t) > 
Jo 
+ < /(t)g*(t),g^(t) -g*(t) > + < J{t)u*{t),v{t)-u*{t) >]dt+ < U{T)x*{T),x'"{T)-x*{T) > 

+ <g(0)r(0),?/^(0)-2/*(0)>}. 
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Using lemma 2.1 to < x'"{t) -x*{t),m{t) > + < y'"{t) -y*{t),n{t) > , 
we obtain 

< U{T)x*{T),x-{T)-x*{T) > + < Q{0)y*{0),y^{0)-y*{0) > 

= -E [<F{t)x*{t),x''{t)-x*{t) > + < G{t)z*{t),z%t)-z*{t) > + <H{t)y*{t),y''{t)-y*{t) 
Jo 
+ < I{t)q*{t),q''{t) - q*{t) >< J{t)u*{t),v{t) - u*{t) > + < E{t)m{t),v{t) - u*{t) > 

+ < E'{t)p{t),v{t)-u*{t) > + < E"{t)n{t),v{t)-u*{t) > + < E"'{t)5{t),v{t)-u*{t) >]dt. 
So by the definition of u*{t), 

J{v{-)) - J{u*{.)) 
> < E{t)m{t),v{t) -u*{t) > + < E'{t)p{t),v{t) -u*{t) > + < E"{t)n{t),v{t) - u*{t) > 

+ < E"'{t)5{t),v{t)-u*{t) > + < J{t)u*{t),v{t)-u*{t) >]dt 
= 0. 

Thus, u*{t) is the optimal control. 

(2) Assume that u^{-) and 'U^(-) are both optimal controls, and the cor- 
responding trajectories are (x^(-), z ^{■),y^{-), q ^{■)) and {x'^{-), z ^(■), 2/^(-), q ^(■))- 
By the uniqueness of solutions of (3.11), we know that the trajectory cor- 
responding to ^^^^^^±^ is (^^«±^, ^^^^^, ^^^«±^, ^-^^^^). Since 
J{t),U{t),Q{t) are positive and F{t),G{t), H{t), I{t) are non-negative, we 
have 

J(u\-)) = Jiu\-)) = a>0 

and 

2a = Jiu\-)) + Jiu\-)) 

rp 

> 2j( "^(y" )+2E/ <j(t) "'»/" , "^»/" >(it 

"'0 



'0 



> 2a + lEl \uH-)-uH-)\^dt 



where /3 > 0. So 

eI \u\-)-u'{-)\^dt<0 
Jo 

which shows that u^(-) = u^i-). D 
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Example: Consider the following backward doubly stochastic LQ 
problem, where u{-) G R and n = k = d = l = l. 



-dxt = l{zt + Ut)dWt - ZtdBt, 

xo = 0, 

-dyt = \{qt - Ut)dBt - qtdWt, 

Vt = 0. 
The cost function is 



(3.12) 



j{u{-)) = E rix2(t)+iy2(t)+i(t+i)g2(t)+l(r+2)n2(t)rft+^£;[x^(r)]+l£;[i/2(o)]. 

From (3.12), we get 

xt= 2^^'^'^ Us)dWs - / z.dBs, yt= 2^^^~ '^"^^^^ ^ ^*^^«- 
By substituting x{t) and y{t) into the cost function, we obtain 



1 



1 



J{u{-)) = EJ^ -(t+l)(Q^(t)-«^(t)) + ^(T-t+l)(^^(t)--t.^(t))+-(t+l)t.^(t)rft. 

Thus, the optimal control is u*{t) = with the optimal state trajectory 
{x*{t),z*{t),y*{t),q*{t)) = 0. The adjoint equations are 



-dnit = \ptdBt - PtdWt, 
rriT = 0, 
drit = ^6tdWt - 6tdBt, 



(3.13) 



no 



0. 



It is obvious that (m(t),p(t)) = (0,0) and {n(t),S(t)) = (0,0) are the 
unique solutions of the above equations. 
Moreover, the Hamiltonian is 

Hit, x\ z\y\ q\ u\ m, p, n, (5) = i(T + l^^it). 
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